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Abstract 

In this article, we generalize the Wasserstein distance to measures with different masses. We study the 
properties of such distance. In particular, we show that it metrizes weak convergence for tight sequences. 

We use this generalized Wasserstein distance to study a transport equation with source, in which both 
the vector field and the source depend on the measure itself. We prove existence and uniqueness of the 
. solution to the Cauchy problem when the vector field and the source are Lipschitzian with respect to the 

' generalized Wasserstein distance. 
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The problem of optimal transportation, also called Monge-Kantorovich problem, has been intensively 
studied in the mathematical community. Related to such problem, the definition of the Wasserstein distance 
£SJ ' in the space of probability measure has revealed itself to be a powerful tool, in particular for dealing with 

dynamics of measures (like the transport PDE, see e.g. [2]). For a complete introduction to Wasserstein 
| distances, see [15, 16]. 

The main limit of this approach, at least for its application to dynamics of measures, is that the Wasser- 
stein distance W p (fi, v) is defined only if the two measures fi, v have the same mass. For this reason, in this 
article we first define a generalized Wasserstein distance Wp' b (fi, v), combining the standard Wasserstein 
and L 1 distances. In rough words, for W£ ' b (//, v) an infinitesimal mass S/j, of fj, (or v) can either be removed 
at cost a|<fyt|, or moved from fj, to v at cost bW p (8fi, 5v). 

After proving simple but important properties of this distance, we show that it is the natural distance to 



i — \ 

■ state existence and uniqueness of the solution to the following dynamics of measures: 

id t fi + V ■ (v [fi] n) = h\n\ , ^ 

V| t=0 = Mo- 
Such equation is intensively used for modelling of crowd dynamics, where Mt represents a pedestrian density 
(see e.g. [5, 7, 9, 10, 11, 12, 13, 14]). Several authors have studied (1) without source terms, i.e. h = 0, 
showing that it is very convenient to use the standard Wasserstein distance in this framework. In particular, 
in [1] the authors prove existence and uniqueness of the solution under Lipschitzianity of v with respect to 
M, when the space of measures (of given fixed mass) is endowed with the Wasserstein distance. See also [4]. 
We also showed that these hypotheses give convergence of numerical schemes [11]. 
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The main limit of the approach based on the standard Wasserstein distance is that it cannot encompass 
the case of a source h. Indeed, in this case the mass of the measure /i t varies in time, hence in general 
W p (/j,t, Us) is n °t defined for t ^ s. Sources (and sinks) are nevertheless very interesting for models of 
pedestrian, for instance in the case of people entering or exiting a door. It is interesting to recall that the L 1 
distance (that one could try to use, since it is defined even between two measures with different masses) is 
not suitable in this context, since Lipschitzianity of v with respect to fi measured in L 1 does not guarantee 
uniqueness (see [11]). 

In this article, to deal with a source in (1), we focus on the space of Borel measures with finite mass on 
R d (denoted with M), that we endow with the generalized Wasserstein distance W£' b . We also denote with 
A4q c the subspace of M. of measures that are absolutely continuous with respect to the Lebesgue measure 
and with bounded support. In this framework, we prove existence and uniqueness of the solution of (1) with 
Ho € Mq c under the following hypotheses: 



(H) 

The function 



satisfies 



M -> C 1 (K d )nL c 



• v \p] is uniformly Lipschitz and uniformly bounded, i.e. there exist L, M not depending on 
/j,, such that for all fi G M, x, y G M. d , 

\v [ M ] (x) - v [fj] (y)\ < L\x - y\ \v [ M ] (*)! < M. 

• v is a Lipschitz function, i.e. there exists N such that 

ll«M-«Mllco < nw%* {n,v) . 

The function 

I m ^ Mm] 

satisfies 

• h [fj] has uniformly bounded mass and support, i.e. there exist P,R such that 

h \p] (R d ) < P, supp (h [fj]) C B R (0). 

• h is a Lipschitz function, i.e. there exists Q such that 

^•'(%].M"])<0^' 1 (^")- 



Remark 1 The hypotheses (H) can be relaxed in standard ways. For example, the results still hold if we 
remove the uniform boundedness of v on R d and ask for uniform boundedness only in a point x . See e.g. 
[3, Chap. 2]. 

Remark 2 The application to pedestrian dynamics also explains the choice of the basic assumptions (H), 
namely that we deal with measures with bounded support. 



The structure of the paper is the following. In Section 1, we define the generalized Wasserstein distance 

p ' 



Wp' b and we prove some important properties related to that. In particular, this distance metrizes the weak 
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topology for tight sequences. Moreover, M. is complete with respect to such distance. We also compare W£ ' b 
with other distances, like the Levy-Prokhorov distance. We then restrict ourselves to the study of W^ b in 
Mq c and provide Gronwall-like estimates under flow action. 

In Section 2, we describe the complete picture for (1) under (H). We first provide a candidate solution 
for (1) via a semi-discrete Lagrangian scheme and using the sample-and-hold method. We then show that it 
is indeed a solution, and finally that it is unique. 

1 Generalized Wasserstein distance 

In this section, we define the generalized Wasserstein distance W£ ' b (/j,, v) that we study in this article, 
and prove some useful properties. We first recall basic definitions and notations about measure theory and 
Wasserstein distance. For a complete introduction, see [6, 16]. 

1.1 Notation and standard Wasserstein distance 

In this section, we fix the notation that we use throughout the paper, and recall definitions and properties 
related to measure theory and the Wasserstein distance, like push-forward of measures 7#/x and transference 
plans. 

Let \i be a positive Borel measure with locally finite mass. If {i\ is absolutely continuous with respect 
to /i, we write /ii <C [i. If /ii <C /i and Hi(A) < fJ,(A) for all Borel sets, we write ji\ < fi. Given a measure 
with finite mass, we denote with := ^i(R d ) its norm. More in general, if fj, = fi + — [i~ is a signed Borel 
measure, we have := |yu + | + | | . Such norm defines a distance in M, that is |/x — v\. It is useful to recall 
that, if /Lti <C /i and d[i\ = f dfi with / G L l (d[i), then \jj,i \ — j \ f\ djj,. 

Given two measures jj,, v, one can always write in a unique way /i = fi ac + jjb s such that jj, ac <C v and 
Us _L v, i.e. there exists B such that fJ, s {B) = and i/(M. n \ B) = 0. This is the Lebesgue's decomposition 
Theorem. Then, it exists a unique / € L l {dv) such that d/j, ac (x) = f(x) dv{x). Such function is called the 
Radon-Nikodym derivative of \i with respect to v. We denote it with D v \x. For more details, see e.g. [6]. 

Given a Borel map 7 : R d — > M. d , one can consider the following action on a measure fj, £ A4: 

7# M (A) := M ( 7 -V)). 

An evident property is that the mass of fi, i.e. /x(K d ) is identical to the mass of 7#/u. 

Given two measures /1, v with the same mass, it is thus possible to ask if there exists a 7 such that 
v = 7#M- We say that 7 sends fj, to v. Moreover, one can add a cost to such 7, given by I [7] := 
ImI -1 fg,d \ x ~ l( x )\ p dn{x). This means that each infinitesimal mass Sfj, is sent to 8v and that its infinitesimal 
cost is related to the p-th power of the distance between them. The, one can consider the map 7 minimizing 
such cost, if it exists. This is known as the Monge problem, stated by Monge in 1791. 

In general, this procedure works only for special fj,, v and p. Indeed, there exist simple examples of fi, v 
for which a 7 that sends fi to v does not exist. For example /j, = 25\, v = S + 82 on the real line have the 
same mass, but there exists no 7 with v — -f#[i, since 7 cannot separate masses. Moreover, one can have 
a sequence 7„ of maps such that / [7^] is a minimizing sequence, but the limit is not a map 7*. A simple 
condition that ensures the existence of a minimizing 7 is that \i and v are absolutely continuous with respect 
to the Lebesgue measure. 

For such reason, one can generalize the problem to the following setting. Given a probability measure 7r 
on R d x R d , one can interpret it as a method to transfer a measure /1 on R d to another measure on M. d as 
follows: each infinitesimal mass on a location x is sent to a location y with a probability given by n(x,y). 
Formally, [i is sent to v if the following properties hold: 

ImI / dn(x, •) = dn(x), \v\ / dn(-,y) = dv{y). (2) 
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Such 7r is called a transference plan from fj, to v. We denote the set of such transference plans as Tl(p, v) . Since 
one usually deals with probability measures n, u, the terms \v\ are usually neglected in the literature. 
A condition equivalent to (2) is that, for all f,g G C^°(M. d ) it holds J^d^difix) + div)) dn(x,y) = 
J Rd f(x) d/j,(x) + J Rd g(y) dv(y). Then, one can define a cost for it as follows J [n] :— f R d xR d \x — y\ p dn(x, y) 
and look for a minimizer of J in v). Such problem is called the Monge-Kantorovich problem. 

It is important to observe that such problem is a generalization of the Monge problem. Indeed, given a 
7 sending /i to v, one can define a transference plan 7r = (Id x 7)#/U, i.e. dir(x, y) = d[i(x)5 y=1 ( x y 
It also holds J [Id x 7] = / [7]. The main advantages of such approach are the following: first, the existence 
of at least one it satisfying (2) is easy to check, since one can choose tt(A x B) = ii(A)v(B), i.e. the 
mass from [i is proportionally split to v. Moreover, a minimizer of J in II(^, v) always exists. 

A natural space in which J is finite is the space of Borel measures with finite p-moment, that is 

M p := I a* G M I J \x\ p dfi{x) < 00 J . 

In the following, we also denote with V the space of probability measures, i.e. the measures in M with unit 
mass. We also deal with V p := MP fl V , i.e. the space of probability measures with finite p-moment. One 
can thus define on M p the following operator between measures of the same mass, called the Wasserstein 
distance: 

W p (p,v) = (\»\ min JM) 1 /*. 

7rGn(^,i/) 

It is indeed a distance on the subspace of measures in M p with a given mass, see [16]. It is easy to prove 
that W p (kfx,kv) — k 1 ^ p W p (fi,i l ) for k > 0, by observing that H(kfi,kv) = H(fi,v) and that J [n] does not 
depend on the mass. We will recall some other properties all along the paper, when useful. 



1.2 Definition of generalized Wasserstein distance 

We are now ready to define the generalized Wasserstein distance W p ,h (a*, v). We first give a rough 
description of the idea. Imagine to have three different admissible actions on a*, v: either add/remove mass 
to fi, or add/remove mass to v or transport mass from fj, to v. The three techniques have their cost: 
add/remove mass has a unitary cost a (in both cases); transport of mass has the classic Monge-Kantorovich 
cost J, multiplied by a fixed constant b. The distance is the minimal cost of a mix of such techniques. We 
will show in the following that, depending on fi, all mixes are possible: either remove all the mass of a* and 
v (if they are very far), or transport the whole /i to v (if they have the same mass and are close enough), 
or a mix of the two (for example when \i and v are very close but with different masses). Instead, we will 
prove that add mass is never optimal. 

Remark 3 The fact that the unitary cost of adding/removing mass is identical for the two terms a*, v is to 
ensure symmetry of Wp' b . 

We now formally define the generalized Wasserstein distance. 

Definition 4 Let M be the space of Borel measures with finite mass on M. d . Then, given a, b G (0, 00) and 
p > 1, the generalized Wasserstein distance is 

W^ b {n,v)= inf {a\n-ji\+a\v-u\ + bW p {Ji,u)). (3) 
A~t, £ G M p 

IAI = |*| 

Proposition 5 The operator W^ h is a distance. Moreover, one can restrict the computation in (3) to 
A* < Mi v < v an d the infimum is always attained. 
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To prove this Proposition, we need to recall some known concepts and results related to tightness and 
weak convergence. For more details, see [15, 16]. 

We first recall results about tightness. We always deal with measures on R d . 

Definition 6 A set of measures M is tight if for each e > there exists a compact K E such that fi(R d \K e ) < 
e for all [i £ M. 

Lemma 7 The following holds: 

1. A measure with finite mass is tight. 

2. A set of measures M such that all measures are bounded above by a tight measure fi is tight. 

3. The union of a finite number of tight sets is tight. In particular, a finite set of measures with finite 
mass is tight. 

Proof. The first result is proved by taking a sequence of invading compacts {K n }. Since yu(K d ) = 
\im n n{K n ) = E^iA 1 ^" \ K n - X ) < oo, then ^i{R d \ K l ) = £~ =i+1 li(K n \ K^ 1 ), as small as needed, 
since it is the remainder of a converging sequence. 

For the second property, for each e take K e that gives tightness of fi. Since \x £ M implies fi < jl, hence 
/i{R d \K e )<e. 

For the third result, take the sets Mi, . . . ,M n , each of them being tight. For each e > 0, there exists a 
corresponding compact K\ such that for each \l £ Mj satisfies ^(IR'' \ K % e ). Then, define K e — Uf =1 -fQ, that 
is compact because it is a finite union of compacts. Now take \i £ U™ =1 Mj and observe that fi £ Mi for some 
i, thus fi(R d \K e ) < fi(R d \K l £ ) < e. □ 

We now recall results about weak convergence and give some technical lemmas we use in the following. 

Theorem 8 (Prokhorov's theorem) Let X be a Polish space. A set P in the space of probabilities V{X) 
is precompact for the weak topology if and only if it is tight. 

We recall that R d is a Polish space (see a definition in [16]), thus Prokhorov's theorem can be applied in our 
setting. 

Theorem 9 (Weak compactness [6]) Let [i n be a sequence of Radon measures in M. d with uniformly 
bounded mass 1 , i.e. there exists M such that < M for all k. Then there exists a subsequence [i nj and a 
Radon measure n* such that /x" J — ^ n* . 

From now on, we denote with the asterisk n*,c*,... the limit (or the weak limit) of a sequence /x„, c n ,... 

Lemma 10 Let /i™, v n be two sequences of B or el measures such that /U™(R ) = v n (M. ) for each n. For each 
n, let 7r™ be a transference plan with marginals fj, n ,v n . If we have fj, n — fj,* , v n — ^ v* , 7r™ — it* for some 
yU* , v* , 7r* , then 7r* is a transference plan with marginals fM*,v*. 

Proof. The weak convergence of n n means that, for each / £ C^°(R d x R d ) we have J f(x,y)dn n (x,y) — > 
J f(x, y) dn*(x, y). In particular, choose f(x, y) = g(x) and observe that 

J g{x)dn*(x,y) <- J g(x) dir n {x, y) = j g{x) dfi n (x) -»• J g{x) dfi* {x). 

By uniqueness of the limit, we have the proof for the marginal n*. Using f(x, y) = h(y), we have the same 
for v*. □ 
We are now ready to prove Proposition 5. 

Proof of Proposition 5. The symmetry property W p ° :b (fj,, v) = Wp ,b {y, jj) is evident. 



x The result in [6] is stated with a weaker condition, that is uniformly boundedness of mass on each compact. 
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We first prove that we can always restrict to p < \i. Define 

C(p, v) := a\fi — p\ + a\y — v\ + bW p (p, v). 

First assume that the infimum of C is attained by p \x and a certain v. Let 7r e n(/i, v) be the transference 
realizing W p (p, v). Let d be the Radon-Nikodym derivative / = D^p and := p — f/i the orthogonal of 
p with respect to [i. Define p := min {/, 1} and v the image of /I under ir. Since p < p and 7r G n(/i, £), 
then v <v. Moreover, \v — v\ = \p — p\, since \p\ = \t>\, \p\ = \i>\ by construction. Observe that 

\H-P\ = f\l-f\d^ + ^(R d )= f (l_/)d M +/ (/-l)d M + ^(K d ) = 

•/ Jl-f>0 Jl-f<0 



Hence 



{dn -dp)+[ (dp - dp) - > 1}) ) + /xx(K ) >\fi-p\ + \p-p\. 

f<i \Jf>i 



\fi - n\ + \v - v\ < \n - p\ - \fi - p\ + \v - i>\ + \i> - v\ = \fi - p\ + \v - v\. (4) 



The fact that W p (fi, v) < W p (p, v) is a direct consequence of the fact that 7r € H(p,i>) can be restricted 
to 7r' G H(p,P), by construction of v. Moreover, the cost of tt' is smaller 2 that the cost of ir. Using this 
inequality and (4), we have that C(p, D) < C(p, v). Since p < fi, then the result is proven. 

If the infimum is not attained, consider a minimizing sequence p n and construct each p n := min {D^p n , 1} fi, 
that gives another minimizing sequence with p n < [m. By symmetry, the same property can be proved for 
the term v. 

We now prove that the infimum in (3) is always attained. To prove it, we restrict ourselves to 
P < < v. Take a sequence p n ,v n such that C(p n 1 v n ) — > W p ' b (p,v) and p n < fx, v n < v. Since both 
/j,, v G Ai have finite mass, then {p n }, {v n } have both uniformly bounded masses. Thus, due to Theorem 
9, passing to sub-sequences we have p n — yu*, v n — ^ u* . We now prove that C(^i*,^*) < lim„ C(p n , v n ) = 
W p ' b (n,y)- First recall that weak convergence gives | \i — n* \ < liminf„ — p n \ and equivalently for \v — v *\. 
We are left to prove that 

W p {n*,v*) <YimW p (p?\D n ). (5) 

n 

If ii* = v* =0, then we are done. Otherwise, the sequence c„ := \p n \ = \v n \ (eventually passing to a 
sub-sequence) converges to c* > 0. For n such that c n ^ 0, define the probability measures p n = c~ p n , 
v n = c^v"" 1 . It is clear that p n p* = (c*) _1 /i*, and similarly for v n . Denote with 7r™ the optimal 
transference plan in Ii(p n ,v n ) — H(p n ,D n ). Since p n < su ^ c fi and u n < su ^ c v, then M := {p n }, 
N := {D n } are both tight, hence the set of transference plans H(M,N) is tight (see e.g. [15, Lemma 4.4]). 
Hence, due to Prokhorov's theorem, up to sub-sequences we have ir n — tt* for some n* . Using Lemma 10, 
we have that tt* is a transference plan with marginals p* and v* (not necessarily optimal). Since the distance 
is non-negative, then the functional J : n — >• J \x — y\ p dTr(x, y) is lower semicontinuous with respect to the 
weak topology, see [15, Lemma 4.3], thus 

Wp(^V*) = (c*) 1 / p WJp\v*) < (c*) 1 / p J( 7 r*) 1 / p < lim C y p J(7r") 1 / p = lim C y p IU p (^™,P™) = limIT„(/i n ,z> n ). 

n n n 

We now prove that W p b (/U, v) = implies /j, = v. Since the infimum is attained for some p, v, then 
\fj, — p\ = W p (p, v) = \v — v\ = implies n = p = v = v. 

We now prove triangle inequality W° ,b (/x, 77) < ,b (yu, v) + W^- b (v, rj). Denote with p, v 1 the min- 
imizers in (3) for W p ,b (fi, v) and v 2 ,p the minimizers for W p ' b (v, rj). Observe that we have v 1 ^ z> 2 in 
general. Also call ir 1 ,^ 2 the transference plans realizing W p (p, v 1 ), W p (v 2 , rj), respectively. Define now 
v := min {D^D 1 , 1} j> 2 - Define p, fj the marginals of D with respect to tt 1 ,^ 2 respectively, i.e. tt 1 G H(p,D) 



2 Theorem 4.6 in [15] also shows that tt' is optimal, but this is not crucial here. 
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and 7r 2 G n(f',77). Thus we have tt 2 o n 1 G 77). By construction, we have \fi — p,\ = \u — v\. Moreover, 
W p (fi, v) < W p (jl, v 1 ), because we use the same transference plan n 1 with a smaller mass. Similar properties 
hold for 7), 77. We thus have 

W^i^rj) < a \ti-fi\+a\ri-fj\ + bW p {ji,fi)< 

< a\fi - + a\fi a\r) - fj\ + a\f] - fj\ + bW p (fi, v) + bW p (v, fj) < 

< a\fi - fj\ + alD 1 - v\ +a\r]- rj\ + a\v 2 - v\ + bWpifl^ 1 ) + bW p {v 2 ,f]). (6) 

Define / := D^v 1 and v\ := v 1 — fv 2 the orthogonal part of v 1 with respect to v 2 . Observe that both 
v 1 ,v 2 < v, hence 73^2^ > max{/, 1} and is± := v — (D^2v)h> 2 satisfies is± > v\. Then one has 

/(/-min{/,l})di> 2 + z>i(K< i ) = / {f-l)dv 2 +( di> 2 + i>{(R d ) = 
J Jf>i Jf<i 

[ (m&x{f,l} -l)dD 2 + D{(R d ) < [ {du - dv 2 ) + ui_{Wl d ) = \v - v\ 
Jf>i Jf>i 

and similarly \v 2 — v\ < \v — v 1 \. Plugging them into (6), one has the proof. □ 
One interesting feature of this distance is that the | • | term and the Wassertein term W p have different 
degree of homogeneity with respect to translation in W 1 , thus the optimal strategy for W p ' h varies when 
translating one measure. For example, compute W p ' b (So, S x ) as a function of x > 0. We have \6q — S x \ =2 
and ^(Jo, 5 X ) — x. Hence W p ,b (So,S x ) = min {2a, bx}. If 2a < bx, i.e. measures are "far", then the optimal 
strategy is to delete both masses Sq and 6 X , otherwise it is optimal to move 8q to S x with a translation. 

Another simple example permits us to show that optimal strategies can be either based on removing mass 
only (L 1 strategy), or on transporting mass only (W p strategy), or by a mix of them. We give an example in 
Figure 1. Take the measures /U, v on the real line with densities d/i = X[-i,o] d\ dv = X[x,i+x] with x > 0, 
where A is the Lebesgue measure. It is clear that the optimal strategy amounts to choose jl, v with densities 
X[-i/,oi; X[x,x+y] respectively, for a certain parameter y G [0, 1]. The L\ strategy is given by choosing y = 0, 
while the W p strategy is given by y — 1. We now prove that all values of y can be optimal, depending on 
x and the parameters a, b, p. We have — jl\ = \v — v\ = 1 — y and W p (jl, v) — \y\ x ^ p (x + y). We choose 
for simplicity a = b = p = 1. Thus W p ' b (fi, v) — min^^i] 2 — 2y + xy + y 2 . A simple computation shows 
that the minimum is attained by y — if x G [0, 2], and y = for x > 2. This clearly shows that if the 
measures are very close (x = 0), then the best strategy is the W p (since y = 1), while for measures that are 
far (x big) the best strategy is the L 1 . As stated above, varying x G (0, 2) one has mixed strategies. 




mass transportation 



-1 -y x x + y i + x 

Figure 1: Choice of jl, v (shaded) for the computation of W p ' b . 



We now state some simple properties of W p - b . 
Proposition 11 The following properties hold: 

• W^ b [kn, kv) < max {fc 1 ^, k} W^ b (p, v) for k>0, 
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• a 



W^ b (mi + H2,ui + V2) < W*- b ( M i,^i) + W^ b (p 2 , V2). 
\n\-\v\ < W«> b {ii,v) <o(H + M) 



Proof. The first two properties are direct consequences of similar properties for | • | and W p . 



For the third, we first prove the inequality a 



< Wp' b d-ijv). Without loss of generality, we 



IMI - \v\ 

assume \fi\ > \v\. Take any fi < fj,, P < v and observe that \\i — jl\ = \fj,\ — \jl\, and similarly for v, P. Also 
recall that \fx\ = \P\ < \v\ by construction. Now choose jl < fi, P < v realizing W° ' b (fi, v) and observe that 

{n, v) > a\n -jl\ + a\u-P\=a (|m| - |A| + M - \f>\) > a (| M | - H + 0) . 

We now prove the inequality W®' b (pi,v) < a(|/i| + Choose p, = v = and observe that C(0, 0) = 

a(\fj,\ + Since W£ ' b is the infimum on all ft, v, we have the inequality. □ 

1.3 Topology of the generalized Wasserstein distance 

In this section, we prove that W£ ,b metrizes weak convergence for tight sequences. We also prove that M. d is 
complete with respect to Wp- b . 

We first prove a simple lemma for Wp ,b , stating that optimal choices jl, v are very close to each other in 
R d . The basic idea is that, if we want to transfer far mass between jl and v, then it is cheaper to remove 
such masses from both measures. 

Proposition 12 Let fj,,v G M and jl, v be the choices realizing W^ b (/i, v). If jl has support contained in a 
compact K, then v has bounded support, contained in the enlarged compact 

K d :=U xeK B(x,d) (7) 

with d = ^f. Here we denote with B(x, r) the closed ball centered in x with radius r. 

Similarly, let jl, v be the choices realizing W^ ,b (fi, v), and tt the transference plan realizing W p (jl, v). Let 
jj! < ft have support contained in a compact K . Then the corresponding marginal v' with respect to tt has 
compact support contained in K d . 

Proof. We prove the second statement, since the first can be recovered by choosing fi' = jl. 

With the notations given in the statement, we prove that the support of v' is contained in K d , by 
contradiction. Assume that there exists d! > d and a Borel set D C M. d \ K d such that v'{D) > 0. Thus 
define v := kv[ with k < 1 chosen in such a way that v(D) < 1. Observe that v < v 1 < v. Define 
v* := v — v < v and (x* the corresponding marginal given by tt. We now prove that the choice n* , v* in (3) 
gives a cost that is strictly less than the optimal choice jl, v. First observe that 

\fi — fi*\ + \v — v*\ = \fi - jl\ + \jl - + \v — v\ + \u - u*\ = 

= \fj,-p,\ + \v-i>\ + 2\D-i/*\ = \iJ,-tj,\ + \i/-D\ + 2\D\. (8) 

Now observe that W£(p,, v) = Wg(fi*, v*) + W^{fl — n*,v — v*), by construction of fi* , v* via the optimal 
transference plan ir. Observe that supp(/i — C supp(^i') C K and that supp(i> — v*) = supp(9) CDC 
R d \ K d . In particular, if x E supp(/x — fJb*),y E supp(i> — v*), then \x — y\ > d' . Thus, given the optimal 
transference plan tt' E H(jl — fi*, v — v*), we have 

Wl{f± - ,v - v*) = J \x-y\ p dTr'(x,y) > J d' p d(v - v*) = \D\d' p . (9) 

Putting together (8) and (9), and using \v\ 1 / p > \p\ since \p\ < 1, we have 
C{ti*,v*) < a\n - jl\ +a\v - v\ +2a\p\ + bW p (fl,P) - bW p (fi - fi*,v - v*) < 

< a\n -jl\ + a\v -P\ + bWpifi, v) + 2a\P\ - bd'\P\ 1/p < C(fi, P) - b [d! - 2^ \p\ < C(jl, P). 
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Thus fi, v is not optimal. Contradiction. Thus supp(V) C K for all d! > d, hence supp(i/) C K . □ 

V a 



We now prove the following convergence theorem, stating that W p ' b metrizes weak convergence for tight 



sequences. 

Theorem 13 Let {^i n } be a sequence of measures in R d , and /U„,/i G M. Then 

Wp b (/i„,/i) — >• is equivalent to /i n — ^ /i and is tight. 

Proof. Fix the following notation: for each n, let be optimal choices in (3) for W^ b (/i„, with 

Mn < Mm v n < /i, and 7r n the transference plan realizing W p {jl n , v n ). 

We first prove Fix e > 0. Let TV be such that W p ' b {fi n ,^) < £ for all n > N. Since {/i„} is tight, 
then the set M := U {fi} is tight too. Given 6 > 0, consider the corresponding Kg giving tightness of 
M. By definition, all m e M satisfy m(R d \ Kg) < S. Due to weak convergence of [i n to fi, we also have 

/J, n (Ks) — > ^{Kg). Choose N' such that \/j, n (Kg)\ — \fi(Kg)\ < e for all n > N' . It means that there exists 

v n (positive or negative, supported in Kg) such that (fj, n + v n )(Kg) = fJ,(Kg). Define fi n := (fi n + u n )\ K and 
fl := /U| K . It is clear that v n — 0, hence fi n — ^ /2. Since has bounded diameter and W p metrizes weak 
convergence in bounded spaces (see [16, 7.12], recalled below in Theorem 14), then W p {jl n ,jl) — > 0. Take 
N > N' such that W p (fj, n ,fi) < S and \v n \ < 5 for all n> N. We now estimate 

W£' b (/j, n , fx) < C(/2„, fi) = a\fj, n - p, n \ + a\fi - fb\ + bW p (p, n , fi) < 

< a\fi n (R d \K S )\+ a\v n \ + a\fi(R d \Kg)\+bS< (3a + b)S. 

Choose 5 = ej (3a + 6) and have the result. 

We now prove =>. As a first, main step, we prove that W p ' b (fJ, n ,fJ-) ~ > implies that {fi n } is tight. Fix 
now e > 0. There exists a corresponding N such that W p ,h (/i„, /i) < e for all n > N, and a compact -fCj such 
that [i(R d \ K £ ) < e, since yu is tight. Define v' n := v n restricted to K E , and /j,' n the corresponding measure 
given by the transference plan 7r„ realizing W p (ji n , v n ). Using Proposition 12, we have that the support of 
[i' n is contained in K d with d — 2f. Observe that such set does not depend on n. By construction, we have 

|/in - y! n \ = - Kl - Kl - K\ = ^(R" \ # e ) < S. 

We now estimate 

fi n (R d \K d ) < \^ n -fi n \+fi n (R d )~^4K d )<^W^ b ^ n}f i) + 1^1-1^1 < £ ~+e. 

This is the tightness of {Ai„} n>Jv . Observe now that the finite set of measures {/ii, . . . , with finite masses 
is tight. Since a finite union of tight sets is tight, we have that {/i n } is tight. 

We now observe that |/x n | < + ^^(/in,^) by Proposition 11, that is a converging sequence of real 
numbers, thus fi n have uniformly bounded mass. We now apply Theorem 9, that gives weak convergence 
of \i n to a certain \i* . We prove that /i = /j*. Using the implication we have W p ' b (fi n ,fi*) — > 0. Thus 
W^ b (ft, n*) < lim„ W;- b {ft, M „) + ( M „, M*) = 0, hence \i = fx* . ' □ 

It is interesting to compare such result with a similar result for the standard Wasserstein distance, that 
we recall here. 

Theorem 14 [16, 7.12] Let \ik be a sequence of probability measures in V p , and fief. Then the following 
statements are equivalent: 

• W p (fj, k ,n) ->■ 0; 

• yUfc — ^ /j, and the following condition holds 



lim limsup / \x\ p d/j,k(x) — 0; (10) 

R ^°° k J\x\>R 
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• Hk — 1 A* arl ^ the p-moment converges, i.e. 

J \x\Pdfi k (x) -> y |a;| p d/i(a;). (11) 

Condition (10) is called "tightness" condition, a notation that could create some confusion with respect 
to Definition 6. Anyway, we remark that condition (10) is stronger than Definition 6, thus our Theorem 13 
applies on a wider class than Theorem 14. First of all, Theorem 14 applies for measures that have all the 
same mass, otherwise W p ([i,k> I*) is n °t defined. Moreover, even taking a sequence of probability measures 
/Ufe, one can have convergence in W£' and no convergence in W p . This occurs exactly in the case in which 
{/Xfe} is tight according to Definition 6 and not according to condition (10). For example, take the following 
sequence of probability measures: fik '■= (1 — k~ p )5o + k~ p 5k- It is clear that such sequence converges weakly 
to p* = S , and that W p (fj, k , »*) = W p {k~ p 5 ,k~ p 5 k ) = fc" 1 j k p S = k^ 1 -fr 0. Indeed, condition (11) 
is not satisfied, since the p-moment of ^* is 0, while the p-moment of [i k is 1. Similarly, condition (10) is 
not satisfied, since for each R the measures fi k with k > R satisfy f\ x \ >R dfik = ^ v !\ x \yR^ P ^k = 1^0. 

Instead, estimate W£ ,b (/ifc, //*) by choosing fik = — k~ p 5k, p,* = (J,* — k~ p 5o, that gives W£' b (pk, M*) = 
a\k~ p Sk\ + a|fc~ p <5o| = 2k~ p — > 0. Thus, {^k} is tight according to Definition 6, as it is easy to prove by 
observing that /Zfc(R \ [— n,n}) < n~ p . 

It is clear that, on the contrary, if {/x^} satisfies (10), then it satisfies Definition 6. Indeed, let {/i/t} 
satisfy (10). For each e > there exist R, m such that J^ >R \x\ p d[ik < £ for all k > m. One can always 
assume R > 1. Thus /ifc(R n \ -6(0, R)) < e for all fc > to, that ensures tightness of {fi k }- 

We now prove completeness of M. 
Proposition 15 A4 is complete with respect to Wp' b . 

Proof. Take a sequence p, n that is Cauchy with respect to Wp' b . We first show that {^„} is tight. Fix S > 
and let N be such that ,b (fj, n , fi n +k) < 5 for all n > N, k > 0. Fix the notation flk, Vk to denote the choices 
in (3) realizing the minimizer W p ' b (/ijv, HN+k), and nk the transference plan realizing W p (jlk,^k)- Let Kg 
be such that AiAr(R d \ Kg) < 5, that exists since /ijv has finite mass. Since jik < , then /i/c(R d \ Kg) < 5. 
Define now /j, k the restriction of jik to Kg and v' k the corresponding marginal with respect to -Kk- Using 
Proposition 12, we have that supp(^) C Kg with d = 

As a consequence Dk(Kg) > v' k {Kf) = p' k (Kg) = fik(Kg). Since = />fe(K d ) by construction, 

we have z/fc(IR d \ Kg) < fik(R d \ Kg) < 5. Since a\^N+k — £fc| < W p ' b ([j,n, HN+k) < 8, we have that 
/xjv+fe^ \ Kf) < D k (R d \ Kf) + 8/a < 5 + S/a. Choose S = and K' e = Kf, then p n (R d \ K' £ ) < e 

for n > N . Thus, we have tightness for n > N. Since {fj,i, . . . , (j>n} is a finite family of measures with finite 
mass, then it is tight, hence the whole {/i„} is tight. 

Observe that {(J, n } has also uniformly bounded mass, thus there exists a subsequence fi nk — 1 fx* for a 
certain fj,*, due to Theorem 9. Using Theorem 13, we have that W®' (fi nk , fj,*) — > 0, and by triangular 
inequality we have Wg ' b (p, n , fj,*) ->• 0. □ 

1.4 Comparison with other distances 

In this section, we compare ' b with two distances proposed in the literature, namely the Levy-Prokhorov 
distance (see e.g. [16]) and the distance Wb 2 defined in [8] by A. Figalli and N. Gigli. 

We first recall the definition of the Levy-Prokhorov distance d^p between two probability measures p,, v: 

dLp(n, v) :— inf {a > such that for any closed A it holds (i(A) < v(A a ) + a} , 

where A a is the enlarged set A a := U x i=aB(x, a). It is clear that the two terms v(A a ) and a represent a W p 
perturbation and a L\ perturbation, respectively. For such reason, there are some common ideas between 
d LP and W^ b . 
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The main difference here is that d^p was defined for probability measures only, while we deal with 
measures with different (finite) masses. Nevertheless, even restricting to the space of probability measures, 
the two distances have different values. We study a remarkable case, that is the distance between zz = Sq 
and v = \5_ dl + \&di on the real line. We study the values of d^p and W p ' b as functions of d\,d 2 . For both 
distances, the goal is to choose the optimal coupling, on one side between ^5 and \8_ dl , and on the other 
side between i<5 and ^S d2 . 

We now compare d^p with Wg' b of parameters a = |,£> = 1. Without loss of generality, we assume 
di < d 2 . The set of different cases is the following: 

• The two masses |5-du ^d 2 are both "far" from So. The best choice for both couplings is to focus on 
L\ distance. For the dLP, it means that d\ > 1. In this case c?lp(^, v) = 1. For W p ' b , it means that 
di > f - = 1. In this case W^ b (/z, v) = C (0, 0) = 1. 

• One of the masses is "close" and the other is "far". This means that d\ < 1 < d 2 . In this case, we 
use the W p distance for the coupling |#o, and the L\ distance for the coupling \8o, \5d 2 - Thus 
d LP (n,v) =sup{i,d 1 } andW^i^v) =C(±6o,±6- dl ) = § + 2-VPdi. 



Both masses are "close, but one is not very close". This is true for ^ < d 2 < 1- In this case, the optimal 
strategies are different for the two distances: 

— for dip, we use the W p distance for the coupling ^Sq, \8-d 1 and the L\ distance for the coupling 
\5<3, Thus d LP (p, v) = sup {±, d\). 

- for Wp - b we use the W p distance for both couplings. This gives W£ ,fc (fi, v) = C(/i, v) = W v (jx, v) — 

Both masses are "very close". This is the case of d 2 < \- For both distance the best choice is to use 
the W p distance. Thus d LP {^, v) = d 2 and W%' b (n, v) = C(fj,, v) = W p {^, v) = [^^j 



,,p \ i/p 



It is easy to prove that the two distances dLP, W p ,b are equivalent as norms on the space (d\, d 2 ). Moreover, 
they have the same level lines around d\ — 0, d 2 = when choosing p = oo. 

We now recall the distance Wb 2 defined in [8]. Such distance is defined on a subset with non-empty 
boundary dfl. The idea is that such boundary is an infinite reserve of mass, in the following sense. Given 
two measures /z, v, even with different masses, one can either send a part of mass of /z to v or to the boundary 
dfl. Similarly, the mass of v that does not receive mass from /z goes to the boundary dfl. In both cases, the 
cost is computed via the Wasserstein distance, either from /z to v or from /z to dVl. 

This cost is based on an approach that is rather different than ours. First of all, it deals with a space 
that has boundary, otherwise one could not deal with measures with different masses. Moreover, the cost of 
sending mass to the boundary (that is similar to delete mass in our approach) is computed as a Wasserstein 
distance and not with L\, like in our case. 



1.5 Estimates of generalized Wasserstein distance under flow actions 

In this section, we study properties of W p ' b when restricted to A4q°. In particular, we are interested in 
estimates about the variation of W p - b (/z, v) under action of flows on /z, v. These Gronwall-like properties 
will be useful for the study of solutions of (1). 

We first recall a connection between flows actions on measures and transport equation. Take a Lipschitz 
vector field v, that generates a flow for t > 0. The flow is a diffeomorphism for the space R d , thus one 
can define fi t := $j#^z for a given measure /iq. One has the following theorem. 



11 



Theorem 16 Take a Lipschitz vector field v, and the flow $^ it generates. Given fiQ 6 Mq c , and fi t ■— 
$j#/iO; then [i — /U[o,t] * s the unique solution of the linear transport equation 

(a tl H + v-(v fH ) = o (12) 

in C ([0, T], Mq c ), where Mq c is endowed with the weak topology. 

Proof. The proof is a direct consequence of [16, Thm 5.34]. See details in [11]. □ 
To study (1) in the setting of the generalized Wasserstein distance, it is interesting to check the evolution 
of Wp ' 6 under flow action. We first recall here some properties about standard Wasserstein distance, that we 
proved in [11]. Here, we emphasize the fact that the Wasserstein distance is computed between two measures 
/i, v with same mass, in general different than 1. 

Proposition 17 Given v, w bounded and Lipschitz vector fields of Lipschitz constant L and [i,v £ A4q c , the 
following holds: 

1. W P W#n,* v t #v) < e^ Lt W v {^v), 

2. W p (n,$?#n) <t||t;|| c o/i(R<') 1 /p, 

3. W p (*%#n, ^#u) < e^ u W v (ii, v) + ^(R^Vp '^y- 1 ) ||„ - w || c0 , 
We now prove similar properties for the generalized Wasserstein distance. 

Proposition 18 Given v,w bounded and Lipschitz vector fields of Lipschitz constant L, the following holds: 

1. Wp' b < e*¥ Lt W? h (n, v), 

2. <t||«|| C o/i(R« I ) 1 /p, 

3. W%> b {§ v t #n,§y#v) < e B $ kLt W$< b (//, v) + /i(R d ) 1/p ^'^f'-^ \\ v - w \\ c „. 

Proof. For the first property, take ft, v realizing (3) for W^ h {fi, v) with fi < fi, v < v. Then 

W£' 6 (*?#**,*?#!/) < o|*?#a* - + a|*?#f - + 6Wi,(*?#A, < 

< a\fi - fb\ + a\v -u\+ 6e £ F iLt W p (/i, v) < e £ J iLt (a\fj, - fj\ + a\u -u\ + bW p {fl, u)) . 

The proofs of the second and the third properties are equivalent, based on proofs of Proposition 17 given in 
[11]. ^ ' □ 



2 Existence and uniqueness of solutions of (1) 

In this section we prove the existence and uniqueness of the solution of (1), under the hypothesis (H). The 
key tool is the construction of a candidate solution by sample-and-hold; we then prove that it is indeed a 
solution, and finally prove that it is unique. 



2.1 Construction of an approximated solution 

In this section, we apply the sample-and-hold method to construct a sequence of functions in C ([0, T], Mq c ) 
such that the limit exists and it is a solution of (1). With no loss of generality, we assume that the T = 1. 

We first define an approximated solution /i fc for each fcgN. Given a fixed k, define At :— ^ and consider 
the decomposition of the time interval in [0, At] , [At, 2Ai] , [2 At, 3At] , . . . , [(2 fe - l)At, 2 k At] . 

The idea is based on a Lagrangian scheme with the time-discretization provided above. In particular, we 
define a solution for each time nAt, then consider that v and h are computed at this time and fixed on the 
next interval [nAt, (n + l)Ai). More precisely, for a fixed k, we define 
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r k l 

• fi k := $ T " A #^^ At + r/i [M«At] w ^h n tne maximum integer such that t — nAt > and r := t — nAt. 

We now prove that, given v,h satisfying (H), the sequence \i k is a Cauchy sequence for the space 
{C([0,1],M),V), where 

:= sup % Q < b (/^ t )- 

t€[0,l] 

Since we have proved that (Ai,Wp' b ) is a complete space, then also (C([0, 1] ,M),T>) is complete. Observe 
that we deal with the whole M. to have completeness, and that we will subsequently prove that the limit is 
indeed an element of Mq c . 

We first make three simple observation: 

• At each time t € [0, 1], the mass \i\ (M d ) is bounded. Indeed, first observe that 

^+i)At(R d ) < <&i M " at] #^A t (K d ) + A^K At ](K d )<^A t (K d ) 
As a consequence, we have 

rf(R d ) 1/p < to, with m:= ( Mo (K d ) + ^) 1/p . 
This holds for all k, t. We use such constant in the following estimates. 

• Given a fixed k, the evolution of the scheme satisfies Wp ,b ^fn+ijAf A*nAt) — AiMm + AtP, and 
more in general 

W;> 6 <|i-s|(Mro + P). 

• We will use in the following the estimate 

{^nU 2 )AV ^At) < (/*«S i)At , + % M ^At) < 

< Ai(M m + F) + ^(^,^ At ), (13) 

that is a simple consequence of Proposition 18-2. 

We now estimate the distance Wp- h {j^n+i)Av ' u (n+i)At) w * tn res P ec t to W£ ,h (m^a*' A*nAt) > i- e - tne 
evolution of W^ 1 ' b at the discretization points of the Lagrangian scheme for fi k . 
We use the following compact notations: 



VI := $Xo AtJ Ul := /i 



-At/2 

We have 



VmAt 



W p' b {^ k (n+l)At^(n+l)Atj ~ 

< W a, b ^+1 #v*+ 1 # / i^, vi#v*#/x* At ) + w;< 6 (v^+J#w* +1 ,w*) + (n k+1 ,n k ) 
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We estimate the three parts using Proposition 18. Estimates will be given for sufficiently large k, i.e. for 
sufficiently small At. Moreover, to simplify the notation, we write estimates that hold for all p > 1. We 
estimate the first term as follows: 

w P b (vSJ#^ +1 #^.^# v i#^A t ) < 

< (1 + 2LAt)W»> h (v£ +1 #mJ&, V k #» k nAt ) + m(l + LAt)At\\v [m?„ + +i )a J ~ v [n k nAt ] \\ c ° < 

< (1 + 2LAt) ((1 + 2LAt)W?' b » k nAt ) + mAt(l + LAt)\\v [^+ A \] - v [» k nAt ] || c o) + 
+mAt(l + LAt)N (f(Mm + P) + W^ b (/^UnA*)) < 

< (1 + (5L + AmN)At)w;* (/^UnA*) + m7V(Mm + P)At 2 . 

The second term is estimated as follows, using (13): 

W p" (ylXXm^^r) < K'" (t + i#^ +1 ,«n +1 ) + W? h ,H k ) < #MP + QW^ b (^, ^ At ) 

The third term is simply estimated by W^ h (H k n +1 ,U k ) < QW^ h (/J. k A \, /J. k At ) ■ Summing up, we have 
K'" U^l)At^(n+l)At) < (l + CiAiJW^M&U^+CiAt 2 



with Ci := 5L + 4mA + Q, C2 := mN(Mm + P) + ^1L, both independent on fc, n. Applying it recursively 
in n and recalling that W^ h (/Xq +1 ,/x§) = 0, we have 

WZ* {» k n + A \,ti At ) < Ca A 1 ^^ ^ 1 <2nC 2 At\ (14) 

We use such estimate to prove the convergence of [i k for each t e [0, 1]. We study the three following 
cases: 

t = 1: it corresponds to n = At^ 1 . Applying the triangular inequality and (14), we have 

W^{, k ,^) < 2C 2 (^ + ^ + ... + ^)<^. 

Thus, Hi is a Cauchy sequence in a complete space, hence it is convergent. 

t = -§r for some r, I integers: Consider [i k starting from k = I and apply estimates similar to the previous 
case. 

any t G [0, 1]: For each k, let be the maximum integer such that t — nkAt > 0. We have 

2- k (Mm + P) + W;- h (/i* fc2 -», A*^-*) + ^ k (Mm + P). 



< 



The first and third term are converging with respect to k, uniformly in I. The same holds for the 
second term, as proved at the previous step. 
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2.2 Existence and uniqueness of the solution 

In this section we prove that the measure fl t '■= hm^ n k given as a limit of the sequence of the Lagrangian 
scheme is indeed a weak solution of (1). We will then prove that it is unique. 

First observe that p, t exists, since it is the limit of a Cauchy sequence in (M, Wp' b ), that is complete. It is 
evident that, since /Zq = /x for all k, then p = fj, . We have now to prove that, for each / 6 C£°((0, 1) x R d ), 
it holds 



f dt f (dfit (d t f + v [[i t } • v/) + dh [fi t } /) = o 

JO JM d 



(15) 



The idea is to use again u fe . First of all, we recall the solution of the continuity equation with time- 
independent v or ft and with the other part (ft or v respectively) being identically zero. For h = 0, the 
solution of (1) with time-independent v is \i t = § v t #^o- For v = 0, the solution of (1) with time-independent 
ft is fit = Ho + 1 ft. This gives 

2 fc -l f ( n +l)At 



]T / dt d/i* (dtf + V [/X„At] • V/) + dh [fJ-nAt] f = 

n=0 JnAt JR d 

' fc - 1 An+l)At p / / r k 1 \ 

2 / dt / d *y„ B £ J #MnA t + (* " nAt)fe ($/ + « [^ At ] • V/) + dh [» k nAt ] f ) = 



21 /-(n+l)At f / T * 1 \ 

E / d * / d < [ ~n"At l #^At (ft / + « • V/) + 

„ =0 JnAt JR d \ / 

2k - 1 n(n+l)At / r f 

+ E / dt W- nAt ) / dh ifnAt] dtf + / dh f ) + 

„ =0 JnAt V JR d JR d 

2 fc -l ,(n+l)At 

+ ^ / 

n=0 JnA * 



dt(f - nAt) f dh [» k nAt ] v [fJ-nAt\ ■ V/ = 

' nn+l)At 

+ 0+^ / - nA *) / ^ [^nAt] « [MnAt] ' V/- 

„_ n JnAt JR d 



Due to the boundedness of ft, w, V/, it exists C3 such that 



< C3, that gives 



2 - 1 p(n+l)At 

lim J] / df(* - nAt) / dft [^ At ] v [» k At ] ■ V/ 



n=0 



nAt 



< 



2*-l .(„+l)At a,2 

< lim C 3 V / dt(t - nAt) = C 3 2 k = 



Going back to prove (15), we prove equivalently that 
lim fe 



[ dt [ (dfit (dtf + v [fit] • V/) + dh [fit] f) + 

Jo J& d 

2 fc -l ,( n +l)At /■ 

/ dt / d/x* (5 t / + w [^At] • V/) + dft [//*At] / 

* * JR d 



n=0 



nAt 



0. 
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Since lim fe W° ' b (/^,/z t ) = implies ^ — ^ [i t (Theorem 13), then 



2 e -l 



" _i /-(n+l)At <■ C 1 f 

dt dtfd t f = dt dfitdtf. 

n=0 JnAt JR d Jo JR d 



lim 

fc 



The same applies to h, for which we have lim*, Wff' b (h [/if] , /i [/2 t ]) = 0, thus 

-' I- An+l)At 
n=0 

We are now left to prove 

2 fc -l r (n+l)At 



lim V / dt / dh [rf] f= dt dh [jit] f. 

k „-n JnAt J«. d Jo JR d 



lim 

fc 



pi ,■ z —L ,.(n+l)At ,■ 

\ dt / dfi t V [fl t } ■ V/ - V / dt dtfv [fl-nAt] ■ V/ 

Jo JR d n=0 JnAt JVL d 



0. 



Using the triangular inequality, we prove it by proving the three following: 



• linifc 



0. Since /i t is continuous and w is Lipschitz with 



respect to its argument and bounded, then v [fit] ■ V/ G C£°((0, 1) x thus /if — 1 /t t implies the 
result. 



• lim* 



E to 1 Ji:t 1)At * ^ t fe (« W - v [tA] ) ■ v/ 



= 0. Since it exists a constant C4 such that 



Wp' b (/if, /it) < 2 fe C4, then the previous limit can be proved by recalling that /if has finite mass 



smaller than m and observing that V/ is a bounded function. Thus 

■ r 

dt / d^ t (v\fi t ]-v[^])-Vf 

jR d 



lim 

k 



2-1 ,(n+l)At 
„=0 JnAt 



< lim 

fc 



/ dtm2- k NC 4 max (V/) 

7o [0,1] XR- 



= 0. 



• lim* 



e to 1 /iir )At * ^ (v m v ^ nAt ] ) . v/ 



0. The proof is similar to the previous 



case, recalling that \\v [/if] - v [fJ- nAt ] \\c < (* _ nAt)(Mm + P). 

We have thus proved the first part of the following proposition 

Proposition 19 The measure ft t ■— limfc /if is a weak solution of (1). Moreover, p, t € Mq c . 

Proof. We have to prove that fit G A4q c . Define the non-autonomous vector field v t := v [fit] and the time- 
dipendent measure h t := /i[/it]. Recall that fit is continuous with respect to time, thus v t is a continuous 
vector field with respect to time, each h t is absolutely continuous and h t is continuous with respect to 
time. The corresponding unique solution of (1) is thus in Mq c , see Theorem reft-misuretrasporto. Hence, 
fneM% c . □ 
We now prove that we have continuous dependence of the solution of (1) from the initial data. The 
estimate also give uniqueness of the solution of (1). 

Theorem 20 Let \i t ,v t be two solutions of (1), with initial data fJ,o,vo G M% c respectively. Let v,h satisfy 
(H). Then 



In particular, under assumption (H), the solution of (1) is unique. 



(16) 



16 



Proof. The key observation is that a solution of (1) at time t+s is of the form fi t +s = &s #(M+s h\ptt]+o(s). 
Also observe that the mass of each solution satisfies /U t (R d ) < m>(M. d )+tP < mP , with m := {p, (R d ) + P) 1/p . 

Take now v,h satisfying (H), and \i t ,v t two solutions of (1), with initial data fio,^o respectively (even- 
tually coinciding). Define e(t) := W p ,b {fit, ft) an d observe that it is a Lipschitz function, since 

e(t + s)-e(t) = W? b (iH +a ,v t+s )-W? b (iH,» t )< 

< W;- b (» t+ s,»t) + W a p b (fi t , v t ) + W;- b {ut,v t+8 ) - W a p b ( M t, v t ) < 2s(Mm + P) 

Now observe the following 

e(t + s) < ($: W #ft,$: W #^) + + o(«) < (l + z±±L S + o( S ))w;> b (nt,vt) + 

+/i t (M d ) 1 /P(l + Ls/p + o(s))(s + o{s))NW;' b (fi t , v t ) + sQW^ b (fi u v t ) + o(s) < 
< e(*) + s (£±1L + 2to7V + Q) e(t) + o{s) < e(t) + s (t±±L + 2mN + Q + l) e(t). 

The last estimate holds for sufficiently small s. Using the integral form of the Gronwall inequality we have 
(16). The uniqueness of the solution of (1) is a direct consequence. □ 
Acknowledgments: This work was conducted during a visit of F. Rossi to Rutgers University, Camden, 
NJ, USA. He thanks the institution for its hospitality. 
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